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Abstract — In this paper, an outlier elimination algorithm for 
ellipse/ellipsoid fitting is proposed. This two-stage algorithm 
employs a proximity-based outlier detection algorithm (using the 
graph Laplacian), followed by a model-based outlier detection 
algorithm similar to random sample consensus (RANSAC). These 
two stages compensate for each other so that outliers of various 
types can be eliminated with reasonable computation. The outlier 
elimination algorithm considerably improves the robustness of 
ellipse/ellipsoid fitting as demonstrated by simulations. 

I. Introduction 

Although there are many off-the-shelf algorithms for ellipse 
and ellipsoid fitting (e.g., Q (2 0), there are some lim- 
itations when applying these algorithms to data with many 
highly noisy observations (outliers). Therefore, before fitting 
the data, it is desirable to eliminate the outliers first. To this 
end, we classify a group of unlabeled data into two classes. 
One of them consists of data that can be fit well by an ellipse 
(ellipsoid), and the other consists of data that can be classified 
as outliers. In other words, it is a two-class classification 
problem with prior knowledge on one of the classes. 

As a well-developed area, outlier detection has been studied 
from several perspectives, such as statistics, neural networks 
and machine learning (4). However, due to the special proper- 
ties of the ellipse fitting problem, it is rather difficult to solve 
using merely one type of outlier detection algorithm. 

One difficulty is that the two classes, i.e. outliers and inliers, 
usually intertwine in a complex manner, as shown in Fig. [T] 
Some of the outliers are far from the ellipse (like point A), yet 
others may fall inside the ellipse (like point B). Many outlier 
detection algorithms based on proximity (e.g., fc-NN [5 ]) may 
not be effective for detecting the B-type outliers. 

On the other hand, with a high percentage of outliers, if we 
resort to model-based outlier detection algorithms like random 
sample consensus (RANSAC) |6], which can better handle the 
B-type outliers, the large number of parameters of the model 
looms as a serious problem. On defining w as the portion of 
the inliers, n as the minimum number of data points needed 
to fit a model, and p as the probability of successfully finding 
the correct model after running RANSAC k times, we have 
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the following relationship: p = 1 — (1 — w n ) k . When w is 
small (the percentage of outliers is high), we need k to be 
large to have a sufficiently high p. Unfortunately, the increase 
in k with respect to n is exponential. If we assume w = 0.5, 
to guarantee p = 0.99, we need k = 16 for a simple two- 
dimensional straight line model; yet for ellipse fitting, where 
n = 5, we need k = 146. For an ellipsoid (n = 9), we need 
k = 2356. Moreover, when n is large, the fitting algorithm 
itself becomes rather expensive to run even once. 

To counter these problems, in this paper we develop a two- 
stage hybrid outlier detection algorithm for ellipse/ellipsoid 
fitting. First, we employ an outlier detection algorithm based 
on algebraic graph theory (7), using proximity information 
of the data points. Second, after reducing the portion of the 
outliers among the remaining data set, we employ a robust, 
model-based outlier detection algorithm to efficiently refine 
the results. In the first stage, a large portion of the most distant, 
isolated outliers, which can be a great hazard to model-based 
outlier detection methods, are detected. In the second stage, 
the subtler, closer outliers that go against the ellipse model are 
detected, and some of the misclassified inliers are recovered. 
These two stages compensate for each other to form a more 
efficient and accurate outlier detection algorithm. 

The structure of the paper is as follows: In Section II, the 
model is specified and the three major steps, constructing 
the graph Laplacian and choosing the parameters, detecting 
outliers in the eigenvectors, and refining the final results using 
RANSAC, are described. In Section III, simulation results of 
outlier elimination for both ellipse and ellipsoid fitting are 
described. Section IV concludes our work. 

II. Model and Method 

A. Model: Basic Assumptions 

First, we need to specify the model by stipulating a few 
assumptions on the data. In our description, we restrict to 
the case of ellipse fitting, since it is easy to generalize to 
ellipsoid fitting. Assume that we have a total number of K 
points {(xi, Vi)}fLi with N points from an ellipse with small 
amounts of noise (inliers) and M points randomly scattered 
in the plane (outliers). We do not assume a concrete statistical 
model for the outliers (which is usually hard to do in practice), 
yet we assume the following: 



1) The average distances between inliers are smaller than 
those between inliers and outliers; 

2) The inliers are the majority (> 50%) of the data set; 

3) The inliers arise from an ellipse and their noise level 
is low enough so that it is possible to fit a good model 
based on a small portion of these inliers (ideally 5 points 
for the ellipse case). 



might miss many outliers, while for small t, the graph tends 
to be separated into many weakly connected blocks, and we 
are more likely to misclassify many inliers as outliers. 




Fig. 1. Two different types of outliers. Proximity-based methods are good at 
detecting outliers similar to point A, while model-based methods can better 
detect outliers similar to point B. 

With the first two assumptions, it is possible, by properly 
selecting the radius of connection, to construct a graph based 
on proximity in which the inliers are strongly connected to 
each other, while being weakly connected to the outliers. 
Then, based on some basic concepts of algebraic graph the- 
ory, we can convert the outlier detection problem in a two- 
dimensional space into a simple one-dimensional real-valued 
outlier detection problem among the elements of eigenvectors. 
Moreover, with the third assumption, we can apply a model- 
based algorithm to refine the detection results of the proximity- 
based algorithm. 

B. Constructing Graph Laplacian 

Generally speaking, proximity-based outlier detection algo- 
rithms depend on an adjacency graph constructed by connect- 
ing adjacent points. According to algebraic graph theory, the 
connectivity information is reflected in the graph Laplacian 
and its corresponding eivenvalues and eigenvectors. 

In our algorithm, instead of using an unweighted graph with 
a hard connection rule, the graph Laplacian is constructed 
using heat kernels in the following manner: 

1) Form an adjacency matrix Q with Qij being the Eu- 
clidean distance between the points yi) and (xj,yj)\ 

2) Form a fully connected adjacency graph with edge 
weight Wij = e~ Qi ^ f (heat kernel); 

3) Construct the graph Laplacian matrix: L = D — W, 
where D is a diagonal matrix with its diagonal entries 
as column sums (or row sums) of W; 

4) Compute eigenvalues and eigenvectors for the general- 
ized eigenvector problem, Li = \Df. 

Then, the set {A|A w 0, A G and its corresponding 

eigenvectors form the cornerstone of our proximity-based 
outlier detection algorithm. 

The success of the proximity-based outlier detection highly 
depends on the choice of the "radius of connection" (i.e., y/t in 
the heat kernel). Thus, the value of t is of essential importance. 
Different choices of this radius can result in graphs that differ 
drastically from each other, as shown in Fig. [2] For large 
t, the vertices tend to be strongly connected and hence we 



Fig. 2. Different choices of connection radius lead to different results. The 
figure on the left depicts the situation where the radius is so small that even 
the inliers do not connect sufficiently with each other. The figure on the right 
illustrates the case where the radius is so large that almost all the data points 
are connected with each other, and thus makes it impossible to distinguish 
inliers from outliers. The figure in the middle shows that with a properly 
selected radius of connection, the inliers form a large connected component, 
leaving most of the outliers unconnected. 

The choice of t is a rather delicate problem. Here we present 
an intuitive empirical rule to determine a good value of t. 
Sort all the elements of Q in ascending order and denote the 
resulting vector as q. Then take the pK-th element of q as y/t. 
Intuitively, most of the small elements of Q are composed of 
the distances between inliers, according to our assumptions. 
By taking the pK -th element of q as y/t, we approximately 
guarantee a strongly connected graph among the inliers of 
average degree of at least p — 1, if we assume e _1 is the 
threshold for a "connection". Empirically, p = 4 works well. 

C. Selecting Outliers in Eigenvectors 

Given an appropriate t, the inliers form a strongly connected 
component, leaving most of the outliers loosely connected 
to it and each other. As a result, with a proper interchange 
of rows and columns, the adjacency matrix Q, and thus the 
Laplacian matrix L, is close to a block diagonal matrix. Each 
block corresponds to a strongly connected component, with 
the largest one corresponding to the inliers. 

For each block, there is an eigenvalue approximately equal 
to zero, and the corresponding eigenvector is almost a binary 
vector composed of 0's and l's, with only the elements aligned 
with the block being l's. Here, we assume that the L°° norms 
of all the eigenvectors are normalized to 1. Among all the 
approximate binary eigenvectors that correspond to the close- 
to-zero eigenvalues, some contain more l's than others. The 
key step of outlier detection is to eliminate those data points 
that correspond to the non-zero elements of binary eigenvec- 
tors with very few l's, because the components composed of 
these points are very weakly connected to other components, 
and thus are more likely to be outliers. 

Therefore, finding outliers is approximately equivalent to 
finding protruding l's in the eigenvectors with close-to-zero 
eigenvalues. Thus, we have reduced a problem of high- 
dimensional outlier detection with a complex hypothesis into 
a one-dimensional outlier-detection problem. However, since 
the eigenvectors that correspond to close-to-zero eigenvalues 
are not perfectly binary, more elaborate methods need to be 
employed to detect outliers. 



First, eligible eigenvectors (approximate binary eigenvec- 
tors) are selected from the collection of eigenvectors. Specifi- 
cally, we take the eigenvectors that correspond to eigenvalues 
less than 0.1 as the candidates. Then, keeping only the 
binary eigenvectors, the "high frequency" eigenvectors are 
excluded. Note that f is a "high frequency" eigenvector if 
(Ejl f il - IEj f jl)/Ej l f jl is sufficiently large, i.e. those 
eigenvectors with both large positive and negative elements. 
After that, the one-dimensional outlier detection algorithm is 
employed. For each eligible eigenvector: 

1) Randomly choose half of the elements, and assign them 
as inliers; 

2) Find the 25% quantile median /i, and 75% quantile 
a 3 / 4 of the selected elements; 

3) Create an interval / = [/i— 7(^ — 0^1/4), /i +7(0^3/4-/2)] 
(usually select 7 « 2 ~ 3), and test all the elements: if 
the elements fall into /, then reclassify them as inliers, 
otherwise, reclassify them as outliers; 

4) Repeat Step 2 and Step 3 until the classification of 
outliers and inliers does not change; 

5) Output the selected outliers. 

The algorithm above does not guarantee the "correct" result. 
It can be helpful to run it several times and choose the 
result with minimum intra-class deviation (as in the case of 
RANSAC). However, in our simulations, we ran the routine 
only once, which was good enough for our data. 

D. Refine the Results Using RANSAC 

The algorithm described in the previous two sections uses 
only the proximity information of the data points. However, 
it is incapable of detecting type-B outliers in Fig. [T] and may 
easily lead to errors when the outliers are close to the inliers, 
even if they deviate conspicuously from the ellipse model. 

To eliminate type-B outliers, we need to use the additional 
prior knowledge that the inliers are located on an ellipse. 
We can achieve this goal by running an algorithm similar 
to RANSAC, which has been shown to be inappropriate 
when the percentage of outliers is high and the number of 
parameters in the model is large. However, since we have 
greatly decreased the percentage of outliers in the remaining 
data set by employing the outlier detection algorithm based on 
proximity, it is now feasible to run a RANSAC-type algorithm. 

To make our algorithm more efficient, we slightly modify 
the vanilla version of RANSAC (6). First, we use the entire set 
of inliers selected by the first stage algorithm to fit an initial 
model, instead of randomly choosing the minimum number 
of points as in the original RANSAC, since the remaining 
outliers represent just a small percentage and are close to the 
inliers. Moreover, since our initialization is not random, it is 
unnecessary to run RANSAC repeatedly. We summarize the 
revised model-based outlier detection algorithm as follows: 

1) Fit a model h to the "inliers" selected by the proximity- 
based outlier detection algorithm; 

2) Test all the data points with respect to h, classify the 
points that saliently deviate from h (above a threshold) 
as outliers and then classify other points as inliers; 



3) Refit model h based on the updated inliers; 

4) Repeat step 2 and step 3 until the classification does not 
change any further. 

With this model-based outlier detection algorithm, we are 
able to detect most of the missed outliers in the first stage 
(usually harder ones very close to the inliers), and also clear 
the labels for misclassified inliers. 

III. Simulation Results 

A. A Typical Ellipse Fitting with Outlier Elimination 

It is of interest to inspect a typical simulation for outlier 
detection of an ordinary ellipse to see the performance of our 
two-stage algorithm. In the simulation, we have N = 100 
inliers and M = 50 outliers. The variance of the independent 
Gaussian additive noise of inliers in both x and y directions 
is do = 0.01, and we simply model the outliers as data points 
added by an unusually large Gaussian noise, with <j\ = 2. 
The true ellipse has an eccentricity e = 0.95 with semi-major 
length a — 5, and takes the standard position (centered at 
the origin, with semi-major axis aligned with the x axis). The 
outlier detection results are shown in Fig. [3] 



Fig. 3. Outlier detection results for ellipse fitting. The circled data points are 
classified as outliers by the algorithm. The figure on the left demonstrates the 
detected outliers after the proximity-based algorithm, where there are several 
missed outliers close to the ellipse, while a few inliers are misclassified as 
outliers. The figure on the right shows that these mistakes are corrected by 
the model-based algorithm. 

It is interesting to notice that the first stage of our algorithm 
makes several mistakes, with several outliers missed (the 
ones inside and closely outside the ellipse) and a few inliers 
misclassified (several circled points on the perimeter of the 
ellipse). These mistakes are corrected by the second- stage 
model-based algorithm. And the final result leaves us a group 
of purged, low-noise points from an ellipse. 

B. Improvement and Comparison 

It is worthwhile to compare the performance of fitting 
procedures with and without outlier elimination for different 
numbers of outliers. Here, we choose N — 100, M from 1 
to 55, do = 0.1, and o\ = 3. The true model for the ellipse 
remains the same. The comparison results are shown in Fig. 
[4] on the left. We measure the errors using the non-overlapped 
area of the fit and true ellipses (normalized by the area of the 
true ellipse) in the simulation. And the curves demonstrate 
the average performance of the two approaches. It is worth 
pointing out that for most of the cases, the performance of the 
fitting algorithm with outlier detection does not deteriorate as 
the number of outliers increases, indicating an almost perfect 
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Comparison of Our Method with Direct RANSAC 
— Ellipse Fitting with Our Method I 
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Fig. 4. Left: fitting errors with and without outlier elimination. Right: 
comparison between the fitting error of our algorithm and that of RANSAC. 



D. Generalization to 3-D: Ellipsoid Fitting 

The basic set up for the ellipsoid fitting simulation is as 
follows: for inliers, N = 300 and cfq = 0.1; for outliers, 
M = 50 and g\ = 5; the ellipsoid takes the standard position, 
with semi-axis lengths a = 5, b = 4, and c = 3. The outlier 
detection results are as shown in Fig. [6] 



r 



detection rate of the outliers. On the other hand, without outlier 
detection, the fitting results are not acceptable even when the 
number of outliers is less than 10. 

We also compared our algorithm with simple RANSAC 
l6l . as shown in Fig. [4] on the right. We chose o\ = 5, and 
varied the number of outliers M from 1 to 90. Our algorithm 
performs consistently well until the percentage of outliers 
reaches 50%; on the contrary, the performance of RANSAC 
algorithm with 1000 iterations, though good for the cases with 
a small number of outliers, deteriorates rapidly as the total 
number of outliers increases. Our algorithm demonstrates more 
robustness in the presence of large numbers of outliers. 

There is another robust algorithm, Five Point Fit Ellipse 
Fitting [8], where all five-tuples of points are selected and 
fit by ellipses, the median of the parameters of which being 
the final fitting result. In terms of robustness, this algorithm 
performs competitively with our algorithm; however, it turns 
out to be so much more intensive in computation that it can 
hardly be implemented in a reasonable period of time. 

C. Robustness with Different Types of Outliers 

Because our outlier detection algorithm is a hybrid, it is able 
to tackle a broad variety of outliers and the fitting error can 
be bounded. Here we run our algorithm for different types of 
outliers by adjusting the noise level of outliers g\ G [0.1, 1.2], 
with N = 120, M = 90, a = 0.1, and the true model for the 
ellipse unchanged. The interesting result is shown in Fig. [5] 
where for outliers closer to the inliers as well as ouliers distant 
from the inliers, our algorithm performs consistently well, with 
fitting error tightly bounded below a low level. This shows the 
strong robustness of our scheme. 

Ellipse Fitting Error under Different Outlier Noise Level 



— Ellipse Fitting with Outlier Detection 



Outlier Noise Level a 



Fig. 5. The performance of our two-stage algorithm under a variety of outlier 
noise levels. Our algorithm performs consistently well for outliers with very 
different magnitudes. 



Fig. 6. Outlier detection results for ellipsoid fitting. Similarly to the ellipse 
fitting case, the proximity-based algorithm eliminates most of the distant 
outliers, yet misclassifies some inliers as outliers, with several close-by outliers 
missed, as illustrated on the left figure. These errors are corrected by the 
second stage model-based algorithm, as shown in the right figure. 



IV. Conclusion 

Proximity-based outlier detection algorithms are good for 
cases in which the outliers are wildly contaminated and large 
in number. However, such algorithms work poorly for outliers 
that are close to the inliers, even though these outliers are 
obviously not consistent with the model; on the contrary, 
model-based algorithms are very good at detecting a small 
portion of outliers that are not consistent with the model, yet if 
the percentage of outliers is high and the number of parameters 
for the model is large, the implementation of model-based 
algorithms can be costly. In the problem of ellipse/ellipsoid 
fitting with many outliers, by combining these two types of 
algorithms, we have found a promising method that performs 
robustly with high accuracy for a variety types and numbers 
of outliers. We expect that this method can be generalized to 
other similar fitting problems, with large numbers of outliers 
and models with many parameters. 
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